(N 

OS 



O 



APPEARED IN BULLETIN OF THE 
AMERICAN MATHEMATICAL SOCIETY 
Volume 27, Number 2, October 1992, Pages 217-238 



PREVALENCE: A TRANSLATION-INVARIANT "ALMOST 

EVERY" 



OS . ON INFINITE-DIMENSIONAL SPACES 

i— I 

•«— > 
O 



BRIAN R. HUNT, TIM SAUER, AND JAMES A. YORKE 



Abstract. We present a measure-theoretic condition for a property to hold 
"almost everywhere" on an infinite-dimensional vector space, with particular 
^5^ ■ emphasis on function spaces such as C k and L p . Like the concept of "Lebcsgue 

UL ■ almost every" on finite-dimensional spaces, our notion of "prevalence" is trans- 

lation invariant. Instead of using a specific measure on the entire space, we 
define prevalence in terms of the class of all probability measures with corn- 
ed , pact support. Prevalence is a more appropriate condition than the topological 

concepts of "open and dense" or "generic" when one desires a probabilistic 
result on the likelihood of a given property on a function space. We give sev- 
eral examples of properties which hold "almost everywhere" in the sense of 
prevalence. For instance, we prove that almost every C 1 map on R n has the 
^. ■ property that all of its periodic orbits are hyperbolic. 

o' 
cs 

O \ 1. Introduction 

£N| . Under what conditions should it be said that a given property on an infmite- 

OS ' dimensional vector space is virtually certain to hold? For example, how are state- 

^ . ments of the following type made mathematically precise? 

(1) Almost every function /: [0, 1] — ► R in L 1 satisfies J Q f(x)dx ^ 0. 

(2) Almost every sequence {di}°°^ 1 in I 2 has the property that Y^iLi a i diverges. 

(3) Almost every C 1 function /: R — > R has the property that f'(x) ^ whenever 
f(x) = 0. 

k> , (4) Almost every continuous function /: [0, 1] — > R is nowhere differentiable. 

$H ' (5) If A is a compact subset of R" of box-counting dimension d, then for 1 < 

k < oo, almost every C k function /: R" — > R m is one-to-one on A, provided that 
m > 2d. (When A is a C 1 manifold, the conclusion can be strengthened to say that 
almost every / is an embedding.) 

(6) If A is a compact subset of R™ of Hausdorff dimension d, then for 1 < k < 
oo, almost every C k function /: W 1 — ► R m has the property that the Hausdorff 
dimension of f(A) is d, provided that m > d. 
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(7) For 1 < k < oo, almost every C k map on M. n has the property that all of its 
fixed points are hyperbolic (and further, that its periodic points of all periods are 
hyperbolic). 

(8) For 4 < k < oo, almost every C k one-parameter family of vector fields on K 2 
has the property that as the parameter is varied, every Andronov-Hopf bifurcation 
which occurs is "typical" (in a sense to be made precise later). 

In R rl , there is a generally accepted definition of "almost every", which is that 
the set of exceptions has Lebesgue measure zero. The above statements require 
a notion of "almost every" in infinite-dimensional spaces. We will be concerned 
mainly with function spaces such as L p for 1 < p < oo and C k for (integers) 
< A; < oo on subsets of R"; many of these spaces are Banach spaces, and all have 
a complete metric. The following are some properties of "Lebesgue measure zero" 
and "Lebesgue almost every" which we would like to preserve on these spaces. 

(i) A measure zero set has no interior ("almost every" implies dense), 
(ii) Every subset of a measure zero set also has measure zero, 
(iii) A countable union of measure zero sets also has measure zero, 
(iv) Every translate of a measure zero set also has measure zero. 

One could define "almost every" on a given function space in terms of a specific 
measure. For example, the Wiener measure on the continuous functions is appro- 
priate for some problems. However, the notion of "almost every" with respect to 
such a measure violates property (iv) . The following paragraph illustrates some of 
the difficulties involved in defining an analogue of Lebesgue measure on function 
spaces. We assume all measures to be defined (at least) on the Borel sets of the 
space. 

In an infinite-dimensional, separable 1 Banach space, every translation-invariant 
measure which is not identically zero has the property that all open sets have 
infinite measure. To see this, suppose that for some e, the open ball of radius e 
has finite measure. Because the space is infinite dimensional, one can construct an 
infinite sequence of disjoint open balls of radius e/4 which are contained in the e- 
ball. Each of these balls has the same measure, and since the sum of their measures 
is finite, the e/4-balls must have measure 0. Since the space is separable, it can 
be covered with a countable collection of e/4-balls, and thus the whole space must 
have measure 0. (Even if the space were not separable, we would be left with the 
undesirable property that some open sets have measure zero, violating property (i) 
above.) 

In the absence of a reasonable translation-invariant measure on a given function 
space, one might hope there is a measure which at least satisfies condition (iv) 
above; such a measure is called quasi-invariant . In IR™, there are an abundance of 
finite measures which are quasi-invariant, such as Gaussian measure. However, for 
an infinite-dimensional, locally convex topological vector space, a er-finite, 2 quasi- 
invariant measure defined on the Borel sets must be identically zero p, pll p3 (see 
also pp. 138-143 of pi). 



By separable we mean that the space has a countable dense subset. 
2 By a -finite we mean that the entire space can be expressed as a countable union of sets of 
finite measure. This rules out measures such as "counting measure" , which assigns to each set its 
cardinality. 
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Rather than search for a partial analogue of Lebesgue measure on function 
spaces, our strategy is to find an alternate characterization of the concepts of 
"Lebesgue measure zero" and "Lebesgue almost every" which has a natural exten- 
sion to function spaces. Properties (i)-(iv) alone do not uniquely determine these 
concepts, but there is a more subtle property which does. In M. n , let us consider 
the class of "probability measures with compact support", that is, those measures 
/i for which there exists a compact set T C M™ such that fi(T) = /i(R n ) = 1. 

(v) Let S be a Borel set. If there exists a probability measure \x with compact 
support such that every translate of S has /x-measure zero, then S has 
Lebesgue measure zero. 

Property (v) is proved in §g (see Fact [|) by a simple application of the Tonelli 
theorem (a variant of the Fubini theorem Q]). Notice that conversely, if S C M" 
has Lebesgue measure zero, then the hypothesis of property (v) is satisfied with /i 
equal to (for instance) the uniform probability measure on the unit ball. 

Given a probability measure jjl with compact support, we can define a trans- 
lation-invariant measure jj, on Borel sets S by fl(S) = if every translate of S has 
/i-measure zero and jl{S) — oo otherwise. What property (v) above says is that 
every such measure fi is greater than or equal to Lebesgue measure on the Borel sets 
of W 1 . Thus one way to show that a Borel set is small, in a translation-invariant 
probabilistic sense, is to show that p,(S) = for some \x. Such a strategy is plausible 
on infinite-dimensional spaces because it is not hard to find probability measures 
with compact support (for example, uniform measure on a line segment, or on the 
unit ball of any finite-dimensional subspace). 

In general, we will call a Borel set "shy" if jl(S) = for some probability measure 
fi with compact support, and we call any other set shy if it is contained in a shy Borel 
set (just as every Lebesgue measure zero set is contained in a Lebesgue measure 
zero Borel set). We then define a "prevalent" set to be a set whose complement is 
shy. This definition may not characterize all sets for which the label "almost every" 
is appropriate; our claim is rather that properties which hold on prevalent sets are 
accurately described as holding "almost everywhere" . 

In the absence of a probabilistic notion of "almost every" , statements such as 1-8 
above have often been formulated in terms of the topological notion of "genericity" . 
In this terminology, a property on a complete metric space is said to be generic if the 
set on which it holds is residual, meaning that it contains a countable intersection of 
open dense sets. 3 The complement of a residual set is said to be of the first category; 
equivalently, a first category set is a countable union of nowhere dense sets. The 
notion of "first category" was introduced by Baire in 1899, and his category theorem 
ensures that a residual subset of a complete metric space is nonempty, and in fact 
dense [§0). 

The concepts of "first category" and "generic" have formal similarities to "mea- 
sure zero" and "almost every" , satisfying a set of properties analogous to (i)-(iv) 
above. They also agree for some sets in R n ; for example, the set of rational numbers 
has measure zero and is of the first category. But perhaps too much emphasis has 
been placed on those examples in which first category sets happen to have measure 



3 Many authors require a residual set to be (not just contain) a countable intersection of open 
dense sets. Our terminology follows |20|. 
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zero. Sets which are open and dense in R™ can have arbitrarily small Lebesgue 
measure, and residual sets can have measure zero. 

In fact, many properties are known to be topologically generic in R n but have 
low probability. While the reader may be able to provide examples from his or her 
own experience, we include some for completeness. 

Example 1. For n > 1 let U n = {x e [0,1] : < 2 n x (mod 1) < 2-"}. Notice 
that V m = Un>m Un 1S open and dense but has measure less than 2~ m . Hence 
generically points in [0, 1] satisfy < 2 n x (mod 1) < 2~ n for infinitely many values 
of n, but the set of such points (P| m >i Vm) has measure zero. A similar construction 
arises naturally in |ll| . 

Example 2. Here we consider how well real numbers can be approximated by 
rationals. The Liouville numbers are the real numbers A which have the property 
that for all c, n > there exist integers p and q > such that 



X- P - 

q 



c 

< — 

q n 



As in the previous example, the set of Liouville numbers is residual but has Lebesgue 
measure zero p0j . In contrast are the Diophantine numbers, real numbers /i which 
have the property that for every e > there exists a c > such that for all integers 
p and q > 0, 

P 



A* 

q 



> 



q 2+e- 



The set of Diophantine numbers is of the first category but has full Lebesgue mea- 
sure in every interval. 

Example 3. Arnold studied the family of diffeomorphisms on a circle 

fui,s{x) = x + ui + esinx (mod 2w), 

where < u> < 2ir and < s < 1 are parameters. For each e we can define the set 

S £ — {uj e [0, 2n] : / WjE has a stable periodic orbit}. 

For < e < 1, the set S e is a countable union of disjoint open intervals (one for 
each rational rotation number), and is an open dense subset of [0,27r]. However, 
the Lebesgue measure of S e approaches zero as e — * 0. For small e, the probability 
of picking an u> in this open dense set is very small. See pp. 108-109 of Q for more 
details. 

Example 4. Consider the dynamics of an analytic map in the complex plane near 
a neutral fixed point. Suppose the fixed point is the origin; then the map can be 
written in the form 

z^e 27ria z + z 2 f(z) 

with < a < 1 and f(z) analytic. Siegel [[30] proved that for Lebesgue almost 
every a (specifically, if a is not a Liouville number), the above map is conjugate to 
a rotation in a neighborhood of the origin under an analytic change of coordinates. 
On the other hand, Cremer ||] previously showed that if / is a polynomial (not 
identically zero), then for a residual set of a the above map is not conjugate to 
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a rotation in any neighborhood of the origin. These results are discussed on pp. 
98-105 of §. 

Example 5. Consider the map z i— > e z on the complex plane. Misiurewicz [ fl6| 
proved that this map is "topologically transitive" , which implies that a residual 
set of initial conditions have dense trajectories. On the other hand, Lyubich [[13) 
and Rees [Q showed that Lebesgue almost every initial condition has a trajectory 
whose limit set lies on the real axis (in fact, the limit set is just the trajectory of 
0) . See H] for a discussion of both results. 

Example 6. For many families of dynamical systems in R 2 depending on a param- 
eter, Newhouse llq] and Robinson |25[ constructed a set of parameters for which 
infinitely many attractors coexist. The constructed set is residual in an interval, 
but is shown in [p3| and |p_9[] to have measure zero. 

In view of these examples, one might ask why the concept of "residual" is used. 
Sometimes, one just wants to show that a set obtained by a countable intersection 
is nonempty, or further that it is dense. For example, the existence of continuous 
but nowhere differentiable functions can be proved by showing that they form a 
residual subset of the continuous functions; this argument is due to Banach (see 
§111. 34. VIII of |1^| ). Other times, one wants to show that a set is "large" in a 
topological sense, perhaps because there has been no probabilistic alternative. The 
concept of "prevalence" is intended for situations where a probabilistic result is 
desired. 

In §g we formally define prevalence, shyness (the opposite of prevalence), and 
related concepts, and develop some of the basic theory of these notions. Section y 
examines the eight statements from the beginning of this section in the new frame- 
work. In §y we develop some of the theory of "transversality" (between functions 
and manifolds) in the context of prevalence, and use it to prove the third, seventh, 
and eighth statements. Finally, §|5j discusses some other ideas related to prevalence. 

2. Prevalence 

Let V be a complete metric linear space, by which we mean a vector space (real 
or complex) with a complete metric for which addition and scalar multiplication are 
continuous. When we speak of a measure on V we will always mean a nonnegative 
measure that is defined on the Borel sets of V and is not identically zero. We write 
S + v for the translate of a set S C V by a vector v EV. 

Definition 1. A measure /i is said to be transverse to a Borel set S C V if the 
following two conditions hold: 

(i) There exists a compact set U C V for which < /u(f/) < oo. 
(ii) n{S + v) = for every v E V. 

Condition (i) ensures that a transverse measure can always be restricted to a 
finite measure on a compact set (see Fact |2| below), and in developing the theory of 
transverse measures it is often useful to think in terms of probability measures with 
compact support. For applications it will be convenient to use measures which (like 
Lebesgue measure) are neither finite nor have compact support. If V is separable, 
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then all measures which take on a value other than and oo can be shown to satisfy 
condition (i) pjj . 

Definition 2. A Borel set S C V is called shy 4 if there exists a measure transverse 
to S. More generally, a subset of V is called shy if it is contained in a shy Borel 
set. The complement of a shy set is called a prevalent set. 

Strictly speaking, the above concepts could be called "translation shy" and 
"translation prevalent" . On manifolds for which there is no distinguished set of 
translations, the corresponding theory is more difficult; this is a topic we do not 
address in this paper. We again emphasize that the definitions of shy and prevalent 
would be unchanged if we required transverse measures to be probability measures 
with compact support. 

Roughly speaking, the less concentrated a measure is, the more sets it is trans- 
verse to. For instance, a point mass is transverse to only the empty set. Also, 
we will later show (see Fact ||) that if any measure is transverse to a Borel set 
S C R", then Lebesgue measure is transverse to S too. When V is infinite di- 
mensional, a convenient choice for a transverse measure is often Lebesgue measure 
supported on a finite-dimensional subspace. 5 For example, Lebesgue measure on 
the one-dimensional space spanned by a vector w £ V is transverse to a Borel set 
S C V if for all ueV, the set of A s R (or C if V is complex) for which v + Xw G S 
has Lebesgue measure zero. It immediately follows that every countable set in V 
is shy, and every proper subspace of V is shy. Notice that because it is possible to 
represent an infinite-dimensional space as the continuous linear image of a proper 
subspace, the continuous linear image of a shy set need not be shy. 

We now present some important facts about transversality and shyness. The 
first follows immediately from the above definitions, and in particular implies that 
prevalence is translation invariant. 

Fact 1. If S is shy, then so is every subset of S and every translate of S. 

Fact 2. Every shy Borel set S has a transverse measure which is finite with compact 
support. Furthermore, the support of this measure can be taken to have arbitrarily 
small diameter. 

Proof. Let /jbea measure transverse to a Borel set S C V. Then by condition (i) 
of Definition U] it can be restricted to a compact set U of finite and positive measure, 
and the restriction is certainly also transverse to S. Also, since U is compact it can 
be covered for each e > by a finite number of balls of radius e, and at least one 
of these balls must intersect U in a set of positive measure. The intersection of U 
with the closure of this ball is compact, and the restriction of fi to this set is also 
transverse to S. □ 

An immediate consequence of Fact is that a shy Borel set has no interior. The 
same is then true of every shy set, since every shy set is contained in a shy Borel 
set. Hence we have the following fact. 



4 The word "shy" was suggested to us by J. Milnor. 

5 An exact characterization of Lebesgue measure on a given finite-dimensional subspace depends 
on the choice of a basis for the subspace, but since we are only interested in whether or not sets 
have measure zero, the choice of basis is unimportant for our purposes. 
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Fact 2'. All prevalent sets are dense. 

Next, we would like to know that the union of two shy sets is also shy. Given 
Borel sets S,T C V containing the original sets and measures /i and v transverse 
to S and T respectively, we must then find a measure which is transverse to both 
S and T. We can assume by Fact @ that /j, and v are finite with compact support. 
Then the measure we desire is the convolution /i * v of /x and v, defined as follows. 

Definition 3. Let /i and v be measures on V. Let \l x v be the product measure 
of fi and v on the Cartesian product V x V, and for a given Borel set S C V let 
5 s = {(x, y) G V x V : x + y G S}. Then S s is a Borel subset of V x V, and we 
define /i * i>(S) = n x v(S E ). 

If // and v are finite, then /j, x 1/ is finite, and the characteristic function of S 11 is 
integrable with respect to p X !/. Then by the Fubini theorem GJ, 

fj,*v(S)= I fi(S — y) dv(y) = I v{S — x) d/u,(x). 
Jv Jv 

We thus have the following fact. 

Fact 3. Let /i and v be finite measures with compact support. If \i is transverse to 
a Borel set S, then so is /i * v. e 

From Fact y it follows that the union of two shy sets is shy, and more generally 
the following fact holds. 

Fact 3'. The union of a finite collection of shy sets is shy. 

Fact |] extends to countable unions by a slightly more complicated argument. 

Fact 3". The union of a countable collection of shy sets is shy. 

Proof. Given a countable collection of shy subsets of V, let Si, S2, ■ ■ ■ be shy Borel 
sets containing the original sets. Let fx%, [12, . ■ . be transverse to 5i, S2, ■ ■ ■ , respec- 
tively. By Fact g, we can assume without loss of generality that each /j, n is finite 
and supported on a compact set U n with diameter at most 2~". By normalizing 
and translating the measures, we can also assume that n n (V) = 1 for all n and 
that each U n contains the origin. With these assumptions we can define a measure 
H which is essentially the infinite convolution of the fx n . We rely on the theory of 
infinite product measures; see pp. 200-206 of B for details. 

The infinite Cartesian product U n = U\ x C/ 2 x ■ ■ • is compact by the Tychonoff 
theorem j| and has a product measure fi n — \i\ x /z 2 x ■ ■ • defined on its Borel 
subsets, with /i n ([/ n ) = 1. Since V is complete and each vector in U n lies at most 
2~™ away from zero, there is a continuous mapping from U u into V defined by 

(vi,V2, ■ ■ ■) 1— > v\ + V2 H ■ The image U of U n under this mapping is compact, 

and /j, n induces a measure /x supported on U, given by [i(S) — /i n (S' s ), where 
5 s = {(vi,V2, ■ ■ ■) G U n : vi + V2 + ■ ■ ■ G S}. We will be done if we show that /1 is 
transverse to every S n . 



6 Notice that fj, * v has compact support because its support is contained in the continuous 
image, under the mapping (x, y) ^ x + y, of the Cartesian product of the supports of fi and v. 
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Since the Cartesian product of measures is associative (and commutative), we 
can write /i n = fi n x i/^ with v^ — n\ x • • • x \i n -\ x /i„+i x • • • . Let v n be the 
measure on V induced by z/„ under the summation mapping (as fj, was induced 
by /x n ). Then /i = \i n * i/ n , and therefore by Fact pi fj, is transverse to S n . This 
completes the proof. □ 

We are now in a position to show that the conditions for shyness given in the be- 
ginning of this section can be weakened in some cases. First, consider the following 
definition. 

Definition 4. A measure \x is essentially transverse to a Borel set S C V if condi- 
tion (i) of Definition p] holds and n(S + v) = for a prevalent set of v G V. 

Though essential transversality is weaker than transversality, the following fact 
holds. 

Fact 4. If a Borel set S d V has an essentially transverse measure, then S is shy. 

Proof. Let /i be a measure that is essentially transverse to S. As in Fact m we may 
assume fi is finite with compact support. The set of all v G V for which /j,(S—v) > 
is shy, and hence is contained in a shy Borel set T. Let v be a finite measure with 
compact support which is transverse to T. Then for all w G V , 



(i * v(S + w) = / fi(S + w — y) dv{y) = 
Jv 

since the integrand is nonzero only on a subset of T + w and v(T + w) = 0. Thus 
// * v is transverse to <!?, and S is shy. □ 

Next let us examine a local definition of shyness and prevalence. 

Definition 5. A set S C V is locally shy if every point in the space V has a 
neighborhood whose intersection with S is shy. A set is locally prevalent if its 
complement is locally shy. 

Facts fy, |2j, and p] immediately hold also for local shyness and local prevalence, 
but whether Fact ^Jdoes is not clear in general. If V is separable though, it turns 
out that the local definitions of shyness and prevalence are equivalent to the global 
definitions. (On the other hand, it is not clear that these notions are the same in 
spaces such as L°° and l°°.) 

Fact 5. All shy sets are locally shy. If V is separable, all locally shy subsets of V 
are shy. 

Proof. The first part of this fact is trivial. To verify the second part, recall that 
by the Lindclof theorem Q , if V is a separable metric space then every open cover 
of V has a countable subcover. Given a locally shy set Scl', the neighborhoods 
whose intersections with S are shy cover V. Hence by taking a countable subcover, 
S can be written as a countable union of shy sets. Thus by Fact W\ S is shy. □ 

If V is finite dimensional, then shyness and local shyness are equivalent by Fact 
pi In this case we can show further that both of these concepts are equivalent to 
having Lebesgue measure zero. 

Fact 6. A set S C 1" is shy if and only if it has Lebesgue measure zero. 
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Proof. We need only consider Borel sets, because every Lebesgue measure zero set 
is contained in a Borel set with measure zero. If a Borel set S has Lebesgue measure 
zero, then Lebesgue measure is transverse to S, and S is shy. On the other hand, if 
a Borel set S is shy, then by Fact there is a finite measure /i which is transverse 
to S. Let v be Lebesgue measure. Though v is not finite, it is a-finite, so by the 
Tonelli theorem B we have (as in the equation preceding Fact ra) that 

0=/ n(S - y) dv{y) = I v(S-x)dfi(x)=ii(R n )v(S). 

In other words, S has Lebesgue measure zero. □ 

Fact @ implies that in R", Lebesgue measure is a best possible candidate to 
be transverse to a given Borel set. As we mentioned earlier, when looking for a 
transverse measure in an infinite-dimensional space, a useful type of measure to try 
is Lebesgue measure supported on some finite-dimensional subspace. 

Definition 6. We call a finite-dimensional subspace P C V a probe for a set T C V 
if Lebesgue measure supported on P is transverse to a Borel set which contains the 
complement of T . 

Then a sufficient (but not necessary) condition for T to be prevalent is for it to 
have a probe. One advantage of using probes is that a single probe can often be 
used to show that a given property is prevalent on many different function spaces 
by applying the following simple fact. 

Fact 7. If n is transverse to S <ZV and the support of fi is contained in a subspace 
W C V, then SOW is a shy subset of W. 

Next we use one-dimensional probes to show that all compact subsets of an 
infinite-dimensional space are shy. We prove in fact that given a compact set S C V, 
there are one-dimensional subspaces L for which every translate of L intersects S 
in at most one point. To do this we show that a residual set of vectors in V span 
one-dimensional subspaces L with the above property. Here then is an application 
of the fact that a residual set is nonempty. 

Fact 8. If V is infinite dimensional, all compact subsets of V are shy. 

Proof. We assume V is a real vector space; the proof is nearly identical for a complex 
vector space. Let Sc^bea compact set, and define the function f : Rx S x S ^ V 

by 

f(a,x,y) = a(x-y). 

If a vector v s V is not in the range of /, then v spans a line L such that every 
translate of L meets S in at most one point; in particular, L is a probe for the 
complement of S. We then need only show that the range of / is not all of V; 
we show in fact that it is a first category set. For each positive integer TV, the set 
[-N, N] x S x S is compact, and hence so is its image under /. Thus the range of 
/ is a countable union of compact sets. Since V is infinite dimensional, a compact 
set in V has no interior (see p. 23 of p9|), and is then nowhere dense (because it is 
closed). Therefore the range of / is of first category as claimed. □ 
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3. Applications of prevalence 

From now on, when we say "almost every" element of V satisfies a given property, 
we mean that the subset of V on which the property holds is prevalent. Given this 
terminology, the eight numbered statements from the introduction can be proved 
by constructing appropriate probes (see Definition m . 

Proposition 1. Almost every function f : [0, f ] — > K in L 1 satisfies J f(x) dx ^ 
0. 

A probe for Proposition |l] is the one-dimensional space of all constant functions. 
Notice that this probe is contained in C k for < k < oo, so the above property 
also holds for almost every / in C k . Similar remarks can be made about most of 
the results below. 

Proposition 2. For 1 < p < oo, almost every sequence {a>i}°l 1 in l p has the 
property that Yl^—i di diverges. 

For Proposition @, the one-dimensional space spanned by the element {l/*}^ G 
V is a probe for each 1 < p < oo. 

The third statement in the introduction can be proved using the space of constant 
functions as a probe; this follows immediately from the Sard theorem pq| . Here we 
state a more general result, which uses a higher-dimensional probe. We write /W 
for the ith derivative of /. 

Proposition 3. Let k be a positive integer. Almost every C k function f: R — > R 
has the property that at each i£l, at most one of {f^'{x) : < i < k} is zero. 

The space of polynomials of degree < k is a probe for Proposition 13, as we will 
prove in the next section. By Fact W\ Proposition has the following corollary. 

Proposition 3'. Almost every C°° function f: R — ► R has the property that at 
each x £ R, at most one of {f^' (x) : i > 0} is zero. 

Because the dimension of the probe used to prove Proposition y goes to infinity 
as k — > oo, it is not clear whether Proposition |3J can be proved directly using a 
probe. 

Proposition 4. Almost every continuous function f: [0, 1] — ► R is nowhere differ- 
entiate. 

Proposition H requires a two-dimensional probe. A one-dimensional probe would 
be spanned by a continuous function g with the property that for all continuous 
/: [0, 1] — v R, the function / + Xg is nowhere differentiable for almost every A 6 R. 
But if f(x) — —xg(x), then f + Xg is differentiable at x = X for every A between 
and 1. The proof of Proposition uses a probe spanned by a pair of functions g 
and h for which Xg + fih is nowhere differentiable for every (A, /j,) € R 2 aside from 
the origin Q. 

Next we state a prevalence version of the Whitney Embedding Theorem. 

Proposition 5. Let A be a compact C 1 manifold of dimension d contained in R". 
For 1 < k < oo, almost every C k function f: R" — » R 2d+1 is an embedding of A. 
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The probe used in the proof of Proposition @ is the space of linear functions from 
R™ to R 2d+1 . Whitney Q showed that a residual subset of the C k functions from 
R™ to R 2d+1 are embeddings of A. This result was preceded by a topological version 
due to Menger in 1926 (see p. 56 of |10|), which states that for a compact space A 
of topological dimension d, a residual subset of the continuous functions from A to 
R 2d+1 are one-to-one. Proposition H, and the following generalization to compact 
subsets of R" which need not be manifolds (or even have integer dimension), are 
proved in [£8| . 

Proposition 5'. If A is a compact subset o/R" of box- counting (capacity) dimen- 
sion d, and 1 < k < oo, then almost every C k function f: R" — > R m is one-to-one 
on A, provided that m > 2d. 

Our next proposition concerns the preservation of Hausdorff dimension under 
smooth transformations. Once again the probe is the space of all linear functions 
from R" to R m ; see [§7| for a proof. 

Proposition 6. If A is a compact subset o/R" of Hausdorff dimension d, and 1 < 
k < oo, then for almost every C k function f: R™ — > M m the Hausdorff dimension 
of f(A) is d, provided that m> d. 

Remark . It is interesting that Proposition |5j fails for Hausdorff dimension (see [£8| ) i 
and Proposition || fails for box-counting dimension (see [gTJ ) , under any reasonable 
definition of "almost every" . 

We now present a result about the prevalence of hyperbolicity for periodic orbits 
of maps. We say that a period p point of a map /: R" — » R™ is hyperbolic if the 
derivative of the pth iterate of F at the point has no eigenvalues (real or complex) 
with absolute value 1. 

Proposition 7. Let p be a positive integer. For 1 < k < oo, almost every C k map 
on R n has the property that all of its periodic points of period p are hyperbolic. 

Proposition U\ is proved in the next section using the space of polynomial func- 
tions of degree at most 2p — 1 as a probe. Proposition [?] and Fact W*\ imply the 
following more elegant result. 

Proposition 7'. For 1 < k < oo, almost every C k map on R™ has the property 
that all of its periodic points are hyperbolic. 

Next consider one-parameter families of dynamical systems. As the parameter 
varies, it is likely that nonhyperbolic points will be encountered, and at such points 
bifurcations (creation or destruction of periodic orbits, or changes in stability of 
orbits) can occur. In general one can expect to prove results of the sort that 
for dynamical systems of a given type, almost every one-parameter family has 
the property that all of its bifurcations are "nondegenerate" in some fashion. A 
complete discussion of such results is beyond the scope of this paper, but we include 
as an example a result about Andronov-Hopf bifurcations for flows in the plane. 
For flows (as opposed to maps), a fixed point is hyperbolic if the linear part of the 
vector field at the fixed point has no eigenvalues on the imaginary axis. Generally, 
a zero eigenvalue results in a saddle-node bifurcation and a pair of nonzero, pure 
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imaginary eigenvalues results in an Andronov-Hopf bifurcation; see M for details. 
The following proposition is proved in the next section. 

Proposition 8. For 4 < k < oo, almost every C k one-parameter family of vec- 
tor fields f(p,x): R x I 2 -t B 2 has the property that whenever f(/j,o,Xo) = and 
D x f(fiQ,xo) has nonzero, pure imaginary eigenvalues, the flow x = f((A,x) un- 
dergoes a nondegenerate Andronov-Hopf bifurcation in the sense that the following 
conditions hold in a neighborhood U of (fio,xo)'- 

(i) The fixed points in U form a curve (fi,x(n)), where x(n) is C k . 
(ii) The point ()j,,x(p)) is attracting when /i is on one side of no and repelling 

when fi is on the other side. 
(iii) There is a C k ~ 2 surface 7 of periodic orbits mlxl 2 which has a quadratic 
tangency with the plane fi = fiQ. The periodic orbits are attracting if the 
fixed points for the same parameter values are repelling, and are repelling 
if the corresponding fixed points are attracting. 

4. Transversality and prevalence 

The proofs of Propositions [5| [?|, and || are based on the idea of "transversality" , 
which we will discuss now in the context of functions from one Euclidean space to 
another. Given 1 < k < oo and < d < oo, we call M C R™ a C k manifold of 
dimension d if for all x £ M there is an open neighborhood U C R™ of x and a C k 
diffeomorphism <p: U -*• V C R" such that <p{M C\U) = {R d x {0}) C\V. The tangent 
space to M at x, denoted by T X M, is defined to be the inverse image of R d x {0} 
under Dip(x). Notice that an open set in R™ is a C°° manifold of dimension n, with 
tangent space 1" at every point. 

Definition 7. Let A C R" and Z C R m be manifolds. We say that a C 1 function 
F: A — > R m is transversal to Z if whenever F(x) £ Z, the spaces DF(x)(T x A) and 
T F{x) Z spanM™. 

Remark. If DF(x) maps T X A onto R m for all igi, then F is transversal to every 
manifold in R m ; in this case we say that F is a submersion. 

In our applications A is always an open set in R n , so the results below are 
stated only for this case, though they remain valid for functions whose domains are 
sufficiently smooth manifolds. A basic result is the following (see M for a proof). 

Theorem 1 (Parametric Transversality Theorem) . Let B c R 9 and A C R™ be 
open sets. Let F: B x A — > W n be C k , and let Z be a C k manifold of dimension 
d in R m , where k > max(n + d — m, 0). If F is transversal to Z , then for almost 
every A G B , the function F(X, •) : A — * R m is transversal to Z . 

Notice that if F: A — * R m is transversal to Z C R m and the codimension of Z 
(that is, m minus the dimension of Z) is greater than the dimension of A, then 
F(A) cannot intersect Z. This observation is the basis for the following general 
scheme for proving results like Propositions 0, H, and @. To show that almost every 
/ in a space such as C fe (R") has a given property, construct a function F consisting 
of the derivatives of / up to a certain order, and let Z be a manifold defined by 



7 The surface is proved to be C k 2 in jL5fj - However, we suspect that this surface can actually 
be shown to be C* -1 , in which case this proposition applies to C 3 vector fields also. 
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a set of n + 1 conditions which F must satisfy at some point in R" in order for / 
not to have the desired property. By an appropriate generalization of Theorem [H, 
it will follow that for almost every /, there is no point in R n at which F satisfies 
the undesirable conditions. 

Let us formalize the above procedure. 

Definition 8. Let A C R" be open, and let /: A -> R m be C l . For k < I, we 
define the k-jet of / at x, denoted j k f(x), to be the ordered pair consisting of x 
and the degree k Taylor polynomial of / at x. Then j k f is a C l ~ k function from 
A to a space J fe (E n ,R m ) = K™ x P k {W\W n ), where P fe (R n ,R m ) is the space of 
polynomials of degree < k from R™ to R m . We write 

j k f(x) = (x,f(x),Df(x),...,D k f(x)), 

where the coordinates (f(x),Df(x),... ,D k f(x)) represent the (unique) polyno- 
mial in P fc (R™,R m ) with the same derivatives up to order k as / at x. 

Remark. We will later write J fc (R n ,R m ) = J k ~ l (R n ,R m ) x P fe (R™,R m ), where 
P fc (R™,R m ) can be thought of as the space of homogeneous degree k polynomials 
from R n to R m . More precisely, j k f{x) can be decomposed into (j k ~ 1 f(x), D k f(x)), 
where D k f(x) represents a degree k polynomial which is homogeneous in a coordi- 
nate system based at x. 

The following is an example of the type of result we need; it is a prevalence 
version of a result previously formulated in terms of genericity |8[ . 

Theorem 1' (Jet Transvcrsality Theorem). Let A C R™ be open and let Z be a 
C r manifold in J fc (R",R m ) with codimension c, where r > max(7i — c, 0). For 
k + max(n — c, 0) < I < oo, almost every C l function f : A — > R m has the property 
that j k f is transversal to Z . 

Proof. Let P = P fe (R™, R m ), thinking of P for now as a subspace of the C l functions 
from A to R m . We claim that P is a probe (see Definition m for the above property. 
ForpeP, let f p (x) = f(x)+p(x), and define the function F: P x A -> J fe (R",R m ) 
by F(p, x) — j k f P (x). Notice that F is a submersion, because the first n coordinates 
of F are just x, and for a given x the remaining coordinates of F act as a translation 
(by the Taylor polynomial of / at x) on P. In particular, F is transversal to Z. Then 
by Theorem II], for almost every p e P, the function F{p, ■) = j k f p is transversal to 
Z, and therefore P is a probe as claimed. □ 

A special case of Theorem [F] is the following prevalence version of the Thorn 
Transversality Theorem. 

Corollary 1". Let A C R™ be open and let Z be a C r manifold in R m with codi- 
mension c, where r > max(n, — c, 0). For max(n — c, 0) < k < oo, almost every C k 
function f:A—> R m is transversal to Z . 

Propositions ||, [7], and || can be proved using Theorems [j] and |lj, except that we 
would then have to assume in Proposition [| that / is C k+1 and in Proposition [?] 
that the map is C 2 . Instead we use the following results, which do not require those 
additional assumptions and also allow us to avoid determining the entire manifold 
structure of Z. 
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Definition 9. We say that a set S is a zero set in a manifold M of dimension d if 
S d M and for every x £ M there is a neighborhood Z7 of x and a diffcomorphism <p 
on [/ which takes M H £7 to an open set in R d and for which (p(S (~l 17) has Lebesgue 
measure zero in M. d . 

Remark. Since sets of Lebesgue measure zero are preserved under diffeomorphism, 
the particular choice of <p in Definition |9| does not matter; that is, a zero set in M 
has Lebesgue measure zero with respect to all local C 1 coordinate systems on M. 
We will need a Fubini-like result for zero sets of manifolds which allows us to 
prove that a Borel set is a zero set in M by showing that it is a zero set on the 
leaves of an appropriate foliation of M. See J22| for a general result of this type; 
for our purposes we need only the following simple lemma, which follows directly 
from the Fubini theorem. 

Lemma 2. Let M be a manifold of dimension d, and let {M a } be a partition of 
M into manifolds of dimension d! < d with the following property : every x £ M 
has a neighborhood U and a diffeomorphism (p on U which maps M C\U to an open 
set in M. d and which maps those M a which intersect U into parallel hyperplanes of 
dimension d' . If Z is a Borel set in M and Z (~l M a is a zero set in M a for every 
a, then Z is a zero set in M . 

Remark. The hypotheses of Lemma |2] are satisfied if M can be written as Ml x M 2 
with Mi, M 2 manifolds, and the partition of M consists of all manifolds of the form 
{x} x M2 with x £ Mi; this will usually be the case when we apply Lemma |[ 
We now present measure-theoretic analogues to Theorems p] and E71 

Lemma 3 (Measure Transversality Lemma). Let B C M. q and A C R™ be open 
sets, with points in B denoted by X and points in A denoted by x. Let F: B x 
A — » R m x R s be a continuous function with components G: B x A — ► R m and 
H: B x A —>■ R s . Assume that the derivatives D\G, D X G, and D\H exist and are 
continuous at every point of B x A {but D X H need not exist). Let M be a manifold 
in R m with codimension n, and assume that for all x € A and all 5 e I s , the 
function F(-,x) is transversal to M x {y}. Let Z be a zero set in M x R s . Then 
for almost every AeB, there is no x £ A for which F(X, x) € Z. 

Remark. The transversality hypothesis of Lemma [3| is automatically satisfied if 
D\F has full rank at every point of B x A (that is, if F(-,x) is a submersion for 
every x £ A). 

Lemma will be proved at the end of this section. Notice that in the case that 
F is C 1 and Z is a manifold with codimension greater than n, Lemma S is a special 
case of Theorem EL In much the same way as Theorem E] followed from Theorem 
EJ, the next theorem follows from Lemma H. 

Theorem 3' (Measure Jet Transversality Theorem). Assume k > 1. Let A C R™ 
be open and let M be a manifold in J fe_1 (R n , R m ) with codimension n. Let it be the 
projection from J k ~ 1 (M n ,M m ) onto its first n coordinates, and assume that it\m is 
a submersion. Let Z be a zero set in M X P k (W l ,W m ) (see the remark following 
Definition Kh. Then for k < I < oo, almost every C l function f : A — > R m has the 
property that the image of A under j k f does not intersect Z. 



PREVALENCE: A TRANSLATION-INVARIANT 15 

Remark. In our applications, M will be defined by a set of conditions that / and its 
derivatives must satisfy at some point x. When these conditions do not explicitly 
depend on x, the hypothesis that tt\m be a submersion is trivially satisfied. 

Proof. The proof is the same as for Theorem p/L except that we must verify that 
F(p,x) — j k (f(x) +p(x)), where p 6 P fe (R™,R m ), satisfies the transversality con- 
dition of Lemma 0. Given y s P fc (R™, R m ), we have by hypothesis that under pro- 
jection onto the first n coordinates in J fc (R n ,R m ), the tangent space to M x {y} 
at any point projects onto all of R™. The remaining coordinates in J fc (R™,R m ) 
are just P fe (K™,R m ), and when composed with projection onto the latter space, 
F(-,x) is just a translation (and hence a submersion) for every x. Thus F(-,x) is 
transversal to M x {y}, and the hypotheses of Lemma || are satisfied. D 

Theorem pi says, roughly speaking, that given n "codimension one" conditions 
on the (k — l)-jet of / (these conditions can depend on x, but none can depend 
only on x) and an additional "measure zero" condition on the fc-jet of /, almost 
every C k function / on a given set in R n does not satisfy all n+ 1 conditions at any 
point in its domain. We now use Lemma and Theorem m to prove Propositions 
| @, and | 

Proof of Proposition 0. We will show for each pair (ii,^) with < ii < ii < k 
that almost every / in C fe (R) has the property that f( ll \x) and f^ l2 \x) are never 
both zero. Let M be the manifold in J fe - 1 (R,M) defined by f^ = 0. Then M 
has codimension 1, and the set Z C J fe (K,R) defined by f^ 1 ^ — f^ — is a zero 
set in M x P fc (R,R). Therefore by Theorem |], almost every / in C*(R) has the 
property that j k f(x) is not in Z for any x € R, which is exactly what we wanted 
to prove. □ 

Proof of Proposition [7|. We first prove the proposition for fixed points using The- 
orem ^j. Let M be the manifold in J°(R ra ,R") defined by /(x) = x; then x is a 
fixed point of / if and only if j°f(x) lies in M. Notice that M has codimension n, 
and projection onto the first n coordinates is a submersion on M. Let Z be the set 
of 1-jets in M x P 1 (R™,R n ) for which Df has an eigenvalue with absolute value 
1. Then if / has a nonhyperbolic fixed point, j 1 f(x) must lie in Z for some x. We 
will be done once we show that Z is a zero set in M x P 1 (R™,R"). By Lemma 
|2[ we need only show that the set S oi n x n matrices with an eigenvalue on the 
unit circle has measure zero (in the space of n x n matrices, which is isomorphic 
to P 1 (R",R™)). Observe that every ray from the origin meets S in at most n 
points, because multiplying every entry of a matrix by a constant factor multiplies 
its eigenvalues by the same factor. Hence Z is a zero set as claimed. 

For orbits of period p > 1, the condition for nonhyperbolicity depends on the 
1-jet of / at p different points. Thus to apply here, Theorem [3j would have to be 
generalized to p-tuples of jets. Rather than do this in general, we will prove Propo- 
sition |7] directly from Lemma [3| Let us determine what conditions are necessary 
for a set of polynomial functions g\, g2, ■ ■ ■ ,g q : R" — > R™ to span a probe. For a 
given C 1 function /: R™ -> R" and XeW, let 

9 

fx = f + ^2^9i- 

i=l 
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We must show that for almost every A, all periodic points of f\ with period p are 
hyperbolic. 

Let Xi,X2, ■ ■ ■ ,x p denote elements of R™, and let A C R np be the set of all 
(xi, X2, ■ ■ ■ , x p ) for which Xi ^ Xj when i ^ j. Consider the function F = (G, H), 
where G: W x A -» R np and H: R 9 x A -v R" 2 ?> are defined by 

G(X;x 1 ,x 2 ,--- ,x p ) = (/\(a;i) -x 2 J\{x 2 ) -x 3 ,... ,f\{x p ) -x x ), 

H(X;x 1 ,x 2 ,... ,x p ) = (Dfx(xi),Dfx(x 2 ),--- ,Df x (x v )). 

(Essentially i* 1 consists of the 1-jets of / at Xi, ■ ■ ■ , x p , except that G projects the 
0-jets onto a subspace.) 

For a given A, if Xi is a point of period p for f\, then there is a corresponding 
point (xi, . . . ,x p ) G A at which G = 0. We then let M — {0} in applying Lemma [| 
If in addition x\ is nonhyperbolic, then the matrix I7i=i Df\(xi) has an eigenvalue 
on the unit circle. That is, if (A; xi, . . . , x p ) lies in the set S given by 

f » A 1 

5 = < (Mi, . . . , M p ) G R™ p : _[ _[ Mi has an eigenvalue on the unit circle > , 
where Mi, . . . , M p denote n x n matrices. As in our previous argument for fixed 

2 

points, S has measure zero because every ray from the origin in 1" p intersects S 
in at most n points. Thus we let Z = {0} x S in applying Lemma 0. 

We will be done if we can show that G and H satisfy the hypotheses of Lemma 
0. Now G and H satisfy the differentiability hypothesis of Lemma because f\ 
is C 1 as a function of x and C°° as a function of A. To verify the transversality 
hypothesis, we will show that for all (A; xi, . . . , x p ) G R 9 x A, the derivative of F 
with respect to A has full rank. Since F is a linear function of A, we simply want 
to show, for every (xi, . . . , x p ) G A, that F is onto as a function of A. Recall that 
/a = / + 2 -^i<7i) an d observe that whether or not F is onto is independent of /. 
We have thus reduced the problem to one of polynomial interpolation; we need only 
show there exists a finite-dimensional vector space P of polynomial functions from 
R™ — > R™ such that for any p distinct points x% , . . . ,x p G R™ and any prescribed 
values for the 1-jet of a function at the p points, there exists a function in P whose 
1-jet takes on the prescribed values at the prescribed points. 

We claim that the polynomials of degree at most 2p—l have the above interpo- 
lation property. We are referring to polynomial functions from R" — > R™, but the 
interpolation can be done separately for each coordinate in the range, so for simplic- 
ity we consider polynomials from R™ — > R. Given distinct points Xi,... ,x p G R", 
consider the polynomials 

P,(x) = ft 



xA 2 



1=1 

for j — 1, . . . ,p. Each Pj has degree 1p — 2 and is zero at every Xi except for 
Xj, where it is nonzero. Thus a suitable linear combination of the Pj can take on 
any prescribed values at xi,... ,x p . Next let Pjk(x) be the fcth coordinate of the 
function x i— > Pj{x){x — Xj) for k = 1, . . . , n. Each Pjk has degree 2p — 1, and both 
Pjk and its first partial derivatives are all zero at every Xi, except that the fcth 
partial derivative of Pjk is nonzero at Xj . Then given a linear combination of the 
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Pj which takes on prescribed values at Xi,. .. ,x p , adding a linear combination of 
the Pjk will not change these values, and a suitable linear combination of the Pjk 
can be added to change the first partial derivatives at x\ , . . . , X p to any prescribed 
values. This completes the proof. □ 

Proof of Proposition pL There are two main tasks involved in this proof. First, 
we must formulate conditions on the 3-jet of / which must be satisfied if / has an 
atypical (in the sense of violating one of the conditions in Proposition 0) Andronov- 
Hopf bifurcation. Second, we must show that the set Z of 3-jets which satisfy these 
conditions satisfies the hypotheses of Theorem p\ 

The manifold Al which will contain the 2-jet of every 3-jet in Z consists of those 
2-jets which satisfy the following two conditions: 

(a) / = 0. 

(b) D x f has zero trace and positive determinant. 

(Of course, these conditions really depend only on the 1-jet.) Condition (b) is 
equivalent to the condition that D x f has nonzero, pure imaginary eigenvalues. 
Notice that condition (a) defines a codimension 2 manifold, and adding condition 

(b) makes M have codimension 3. 

Condition (i) of Proposition H follows immediately from the implicit function 
theorem, since the determinant of D x f is nonzero at the bifurcation point (/iq, xq). 
For condition (ii) of Proposition g to hold it suffices that the eigenvalues of D x f 
at the fixed point (/j,,x(fj,)) have negative real parts for \x on one side of /iq and 
positive reals parts for /i on the other side. Since the eigenvalues of D x f are complex 
conjugates in a neighborhood of (/io,xo), each one has real part equal to half the 
trace of D x f. Thus if condition (ii) of Proposition fails, the following condition 
must hold: 

(c) The trace of D x f(fj,,x(p)) has ^-derivative zero at //o- 

The derivative of this trace depends on the 2-jet of / at (/zo,xo) and on x'([io), 
which in turn depends on the 1-jet of /. 

We wish to show that the set of 2-jets in M for which condition (c) holds is a 
zero set in M. Since M depends only on the 1-jet of /, by Lemma it suffices to 
fix the 1-jet and show that as the second derivatives in the 2-jet vary, condition 

(c) fails almost everywhere. Notice that fixing the 1-jet also fixes x'(no). Let the 
coordinates of / be (g, h) and the coordinates of x be (y, z). Then condition (c) 
can be written as 

9/iy + V'{^)9yy + z '{^)9yz + h^ z + y'(fi )h yz + z'(fJ, )h zz = 0, 

where the partial derivatives are evaluated at (/zo^o)- As the second derivatives 
vary over all real numbers, the above equation holds only on a set of measure zero 
(a codimension 1 subspace, in fact). 

We have shown that condition (c) holds only on a zero set in M. By another 
application of Lemma g, the set of 3-jets which satisfy condition (c) is a zero set in 
M x P 3 (R 2 ,R 2 ). 

If conditions (a) and (b) hold while (c) fails, then there is a condition (d) that 
the 3-jet of / must satisfy in order for condition (iii) of Proposition || to fail. If 
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coordinates (u, v) are chosen in such a way that 

D x f{/J. ,x Q ) - 



-u> 

LU 



(where u> is the square root of the determinant of D x f), and g and h are the 
components of / in this coordinate system (this is different from the definition of 
g and h above), then condition (d) can be written as 

^yguuu i guw > ri uuv -r iivvu) > guv\guu > gw) 
riuv\"'uu i i^vv) guu^uu i gw^vv o. 

See |15|| for a detailed derivation of this condition, or J6| for a more expository 
discussion of this problem. 

Notice that given condition (b), D x f can be put into antisymmetric form by 
a linear change of coordinates depending only on the 1-jet of / at (fio,xo), and 
further lo is nonzero and depends only on the 1-jet of /. Writing condition (d) in 
terms of the original coordinates would be tedious; instead we employ Lemma |2j 
again by fixing the 2-jet of / and letting its third derivatives vary. With the 2-jet 
fixed, the above change of coordinates is fixed, and induces a change of coordinates 
on the space P 3 (R 2 ,R 2 ). In terms of the new coordinates, condition (d) determines 
a codimension 1 hyperplanc in P 3 (R 2 ,R 2 ), and in particular the set on which it is 
satisfied has measure zero. Therefore by Lemma g, the set of all 3-jets which satisfy 
condition (d) is a zero set in M x P 3 (R 2 ,R 2 ). 

To summarize, we have shown that in order for the conditions given in Propo- 
sition |8J to fail for a given one-parameter family of vector fields /, there must be 
a point (/Lto,£o) at which conditions (a), (b), and at least one of (c) and (d) hold. 
We have shown that the manifold M C J 2 (R 2 ,R 2 ) defined by conditions (a) and 
(b) satisfies the hypotheses of Theorem |], and that the set Z C M x P 3 (R 2 ,R 2 ) 
on which at least one of conditions (c) and (d) holds is a zero set in this mani- 
fold. Therefore by Theorem m, for almost every / in C the conditions given in 
Proposition g hold. □ 

Proof of Lemma @. We assume without loss of generality that Z is a Borel set; then 
so is F~ 1 {Z). Let m: R 9 x R rl — > R 9 be projection onto the first q coordinates. We 
wish to show that tti(F~ 1 (Z)) has measure zero. Let L = G^ 1 (M), and for x G A 
let L x — L n (R 9 x {x}) be the "x-slice" of L. By the transversality hypothesis, 
G(-,x) is transversal to M, and thus Lcl'x R™ and L x C R 9 are manifolds with 
the same codimension, n, as M C R™ 1 (see p. 28 of M). 

Since L has dimension q, away from its critical points tt\\l is locally a diffco- 
morphism. We will show that F _1 (Z) is a zero set in L; then since zero sets map 
to zero sets under diffeomorphisms, tti(F^ 1 (Z)) consists of a zero set plus possibly 
some critical values of tt\\l- By the Sard theorem pq| , the critical values of tt\\l 
have measure zero, and hence tti(F~ 1 (Z)) has measure zero as desired. 

To show that F~ l {Z) is a zero set in L, we first show for all x £ A that F^ 1 (Z)C\ 
L x is a zero set in L x . Since F(-,x) is transversal to M x {y} for all y G R s , and 
the tangent space T\L X is the inverse image of Tq(\ , x )M under D\G, and both 
tangent spaces have the same codimension, it follows that D\F maps T\L X onto 
Tg(\,x)M x R s for all (A,x) e B x A. In other words, F(-,x) is a submersion 



PREVALENCE: A TRANSLATION-INVARIANT 19 

from L x to M x M s for all x G A. Since Z is a zero set in M x I s , its preimage 
F^ 1 (Z) n La; is a zero set in L x as claimed. 

It remains only to show that the partition {L x } of L satisfies the hypotheses of 
Lemma 0. Let W2' K. 9 x K n — ► R n be projection onto the last n coordinates. For 
each (A, x) G L, the kernel of ~ki in Tc\ x \L is just T\L X . Since the former space has 
dimension q and the latter space has dimension q — n, it follows that 7T2 has rank 
n on Tt\_ x \L. Thus ~K2\l is a submersion, which implies (see p. 20 of H) that near 
every point in L there is a local C 1 coordinate system on L whose last n coordinates 
are the same as those of x. The slices L x of L are parallel hyperplanes in such a 
coordinate system, and therefore the hypotheses of Lemma M are satisfied. □ 

5. Extensions of prevalence 

In this article we have proposed sufficient conditions for a property to be said 
to be true "almost everywhere" , in a measure-theoretic sense, on complete metric 
linear spaces. In other contexts more general definitions may be appropriate. For 
instance, the concepts of shyness and prevalence can be extended from vector spaces 
to larger classes of topological groups |17| ■ 

We have concentrated thusfar on extending the notions of "measure zero" and 
"almost every" to infinite-dimensional spaces. We now briefly consider some ways 
to characterize sets which are neither shy nor prevalent in an infinite-dimensional 
vector space V. 

Definition 10. Let P be the set of compactly supported probability measures on 
the Borel sets of V. The lower density p~(S) of a Borel set S C V is defined to be 

p~(S) = sup inf p(S + v). 

nepvev 

The upper density p + (S) is given by 

p + (S) = inf sup p(S + v). 

If p~(S) = p + (S), then we call this number the relative prevalence of S. 
One can show that for all p,, v G P, 

inf fi(S + v ) < inf \i * v(S + v) < sup p * v{S + v) < sup v(S + v), 

vev vgv v£V v( z V 

and thus < p~(S) < p + (S) < 1 for all Borel sets S. It follows that a shy set has 
relative prevalence zero and a prevalent set has relative prevalence one. However, 
sets with relative prevalence zero need not be shy; all bounded sets have relative 
prevalence zero, for example. 

In M n , having positive lower density is a much stronger condition on a set than 
having positive Lebesgue measure. The following weaker conditions give a closer 
analogue to positive measure. 

Definition 11. A measure p is said to observe a Borel set S G V if p is finite and 
p(S + v) > for all v G V. A Borel set S G V is called observable if there is a 
measure which observes S, and is called substantial if it is observed by a compactly 
supported measure. More generally, a subset of V is observable (resp. substantial) 
if it contains an observable (resp. substantial) Borel set. 
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Every set with positive lower density is then substantial, and every substantial 
set is observable. As in Fact 0, if /i observes a Borel set S then so does /a* v for any 
finite measure v. It follows that an observable set is not shy. In M. n , it follows as in 
Fact o that a set is observable if and only if it contains a set of positive Lebesgue 
measure. In a separable space every open set is observable; given a countable dense 
sequence {x n }, the measure consisting of a mass of magnitude 2~ n at each x n 
observes each open set. 
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